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Abstract 

We study the calibration process in circular ultrasound tomography devices where the sensor positions 
deviate from the circumference of a perfect circle. This problem arises in a variety of applications in 
signal processing ranging from breast imaging to sensor network localization. We introduce a novel 
method of calibration/localization based on the time-of-flight (ToF) measurements between sensors when 
the enclosed medium is homogeneous. In the presence of all the pairwise ToFs, one can easily estimate 
the sensor positions using multi-dimensional scaling (MDS) method. In practice however, due to the 
transitional behaviour of the sensors and the beam form of the transducers, the ToF measurements for 
close-by sensors are unavailable. Further, random malfunctioning of the sensors leads to random missing 
ToF measurements. On top of the missing entries, in practice an unknown time delay is also added to the 
measurements. In this work, we incorporate the fact that a matrix defined from all the ToF measurements 
is of rank at most four. In order to estimate the missing ToFs, we apply a state-of-the-art low-rank matrix 
completion algorithm, OptSpace . To find the correct positions of the sensors (our ultimate goal) we 
then apply MDS. We show analytic bounds on the overall error of the whole process in the presence of 
noise and hence deduce its robustness. Finally, we confirm the functionality of our method in practice 
by simulations mimicking the measurements of a circular ultrasound tomography device. 

EDICS Category: SAM-CALB, BIO-SENS, SAM-IMGA, SEN-LOCL 

I. Introduction 

In most applications involving sensing, finding the correct positions of the sensors is of crucial 
importance for obtaining reliable results. This is particularly true in the case of inverse problems which 
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can be very sensitive to incorrect sensor placement. This requirement can be satisfied in two ways; One 
might put the effort in the construction of the instruments and try to place the sensors exactly in the 
desired positions, or use a method to find the exact positions after the construction of the device. In this 
work we will consider the latter and we call the procedure of obtaining the sensor positions calibration. 
Note that even in the former case, due to the precision of the construction instruments, a calibration is 
needed afterwards for determining the exact sensor positions. Although in rare cases a single calibration 
might be enough throughout the lifetime of the measurement system, it is of great use to have a calibration 
procedure which can be repeated easily and with low cost. 

This work focusses on the calibration problem in circular sensing devices, in particular, the ones 
manufactured and deployed in O, |3fl. These devices consist of a circular ring surrounding an object and 
scanning horizontal planes. Ultrasound sensors are placed on the interior boundary of the ring and act 
both as transmitters and receivers. 

The calibration problem we address in this paper is the following; In the circular tomography devices, 
the sensors are not exactly placed on a perfect circle. This uncertainty in the positions of the sensors acts 
as a source of error in the reconstruction algorithms used to obtain the characteristics of the enclosed 
object. We aim at finding a simple method for calibrating the system with correct sensor positions with 
low cost and without using any extra calibrating instrument. 

In order to find the correct sensor positions, we incorporate the time-of-fiight (ToF) of ultrasound 
signals between pairs of sensors, which is the time taken by an ultrasound wavefront to travel from a 
transmitter to a receiver. If we have all the ToF measurements between all pairs of sensors when the 
enclosed medium is homogeneous, then we can construct a ToF matrix where each entry corresponds to 
the ToF between each pair of sensors. We can infer the positions of the sensors using this ToF matrix. 

To obtain reliable ToF entries appropriate for our purpose, we assume that no object is placed inside 
the ring during the calibration phase and prior to actual measurements. There are a number of challenges 
we are encountering in this work, namely, 

• the ToF matrices obtained in a practical setup have missing entries. 

• the measured entries of the ToF matrices are corrupted by noise. 

• there is an unknown time delay added to the measurements. 

If one had the complete and noiseless ToF matrix without time delay, the task of finding the exact 
positions would be very simple. This problem is addressed in literature as multi-dimensional scaling 
(MDS) 141]. Unfortunately, the ToF matrix in practical setups is never complete and many of the time-of- 
fiight values are missing. The missing entries can be divided into two categories; structured missing entries 
caused by inability of the sensors to compute the mutual time-of-fiights with their close-by neighbors, 
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Fig. 1. Block diagram for the calibration procedure prior to ultrasound tomography. The incomplete distance matrix is passed 
through the OptSpace algorithm which denoises it, estimates the missing entries and removed the unknown time delay. The 
calibration is finished then by applying the MDS algorithm on the completed matrix which estimates the actual sensor positions. 



and random missing entries due to malfunctioning of the sensors or the ToF estimation algorithm during 
the measurement procedure. 

A good estimation of the positions of the sensors can be obtained, if we have a good estimation of 
the missing entries of the ToF matrix. In general, it is a difficult task to infer missing entries of a matrix. 
However, it has recently been established that if the matrix is low rank, a small random subset of its 
entries permits an exact reconstruction (5D. Since a modified version of the ToF matrix (when the entries 
are squared ToF measurements) is low rank, its missing entries can be accurately estimated using matrix 
completion algorithms. To this end, we use OptSpace , a robust matrix completion algorithm developed 
by Keshavan et al. 

On top of the missing entries, we also need to deal with an unknown time delay. This delay it due to 
the fact that in practice, the impulse response of the piezoelectric and the time origin in the measurement 
procedure are not known, and this causes an unknown time delay which should then be added to the 
measurements. To infer this time delay simultaneously with the positions of the sensors, we propose a 
heuristic algorithm based on OptSpace. 

In circular setups, the sensors are not necessarily on a circle and deviate from the circumference which 
in fact motivates the calibration problem. We therefore need to assume that they are in the proximity 
of a circle (the precise statement is given later) and we are required to find the exact positions. Our 
approach is to estimate the local and random missing pairwise distances from which we can then infer 
the positions. As we have already mentioned, we show that a modified version of the ToF matrix has 
rank at most four, and using this property, we propose our calibration procedure. The block diagram 
shown in Fig. [j] summarizes the procedure. 
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A. Related work 

Calibration for circular tomography devises is a variant of sensor localization, a problem that has been 
extensively studied for the past decade |7i |8|]. In sensor localization, given the local connectivity (i.e., 
which sensors are in the communication range of which others), the objective is to devise an algorithm 
that can infer the global position of the sensors. In practice, several methods are deployed as a means 
of obtaining this local information: the Signal Strength |9j], the Angle of Arrival (AOA) lldfl . and the 



Time Difference of Arrival (TDOA) Hill . Our problem is naturally related to sensor localization when 
estimated TDOAs are used to measure the pairwise distances between nearby nodes. One should note that 
due to energy constraints, each node has a small communication range compared to the field size they 
are installed. As a result, only nodes within the communication range of each other can communicate 
and hence estimate their pairwise TDOAs. This situation is depicted in Fig. |2l 

In our problem, however, the local connectivity is precisely the kind of information that is missing. 
In fact, the beam width of transducers and the transition of ultrasound sensors disallow us from having 
reliable ToF's for nearby sensors (see Section III-Bb . For this reason, in practice, the ToF's for close-by 
sensors are discarded and no information regarding their pairwise distances can be deduced. Consequently, 
in our scenario we are faced with a different setting from that of sensor localization; namely, 

• the pairwise distances of neighboring sensors are missing, 

• only the pairwise distances of faraway sensors can be figured out from their ToF's. 

This situation is demonstrated in Fig. |2] By comparing these two scenarios in Fig. [2l one can think of 
the calibration problem for ultrasound sensors as the dual problem of sensor localization. As a result, 
all sensor localization algorithms that rely on local information/connectivity are doomed to fail in our 
scenario. To confirm this fact, in Section [VlIT] through numerical simulations we compare the performance 
of our proposed method with the state-of-the-art algorithms for sensor localization applied in our setting. 



The first sensor localization algorithm we consider is Mds-Map 11211 . This algorithm has two phases. 
First, the Euclidean distance of far off sensors (i.e., the ones that are not in each other's communication 
range) are approximated by the shortest path between them. It was recently shown that having local 



connectivity, the shortest path is a reliable estimate of far off sensors [13]. Second, to estimate the 
relative positions of sensors, multidimensional scaling is applied to the approximated distance matrix. 
However, one can easily see that given faraway sensor's distances, the shortest path is a very coarse 
estimate of the distance between the close-by sensors. This makes Mds-Map perform very poorly in 
our setting. 

One of the most prominent algorithms for centralized sensor localization is based on semi-definite 
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Fig. 2. In sensor localization (left figure) the local connectivity information is available and faraway ones are missing whereas 
in calibration (right figure) the opposite is true. 



programming (Sdp). The method was first introduced by Biswas et al. in H 14TI and solves the sensor 



localization problem using convex relaxation. From a practical point of view, the major problem of Sdp 



based methods is their heavy computations. According to 11411 . the sensor localization for more than 200 



sensors is computationa 



ly prohibitive. Theoretical guarantees of such methods were provided recently 
by Javanmard et al. Il 1 5f1 . As their results suggest, in the case of sensor localization, once the number 
of sensors grow, one cannot reduce the error of semidefinite programming below a threshold unless one 
increases the communication range and hence the power consumption of sensors. We will show, however, 
using the matrix completion, the error decreases as the number of transmitter/receivers grows. 

In the core of our proposed method is matrix completion, the problem that aims to recover a low rank 
matrix from its randomly known entries. It is easy to show that a matrix formed by pairwise distances 
is low rank (see Lemma []]). Based on this property, Drineas et al. suggested using matrix completion 
for inferring the unknown distances Jlrjj . However, their analysis relies on the assumption that even for 
faraway nodes, there is a nonzero probability of communication. This assumption severely restricts the 
applicability of their result in practice. Thinking back to duality between sensor localization and our 
problem, this assumption suggests that in our case the pairwise distances of nearby transmitters/receivers 
can be obtained with a nonzero probability, an assumption that does not hold. Fortunately, in the past 
two years, there has been many improvements on the matrix completion. Candes et al. showed that a 
small random fraction of the entries suffices to reconstruct a low rank matrix exactly. In a series of 
papers 17, 181. Keshavan et al. studied an efficient implementation of a matrix completion algorithm 



so called OptSpace and showed its optimality. Furthermore, they proved that their algorithm is robust 
Jl 

against noise In view of this progress, we were able to show that OptSpace is also capable of 



finding the missing nearby distances in our scenario and hence provide us with their corresponding ToFs. 
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b^the best of our knowledge, all the above work as well as the recent matrix completion algorithms 
2ol only deal with the random missing entries. However, in our case, we are encountered with the 



[19 



structured missing entries in addition to random ones (see Section III-BI ). an aspect that was absent from 
the previous work. Therefore, one of our contributions is to provide analytic bounds on the error of 
OptSpace in the presence of structured missing entries. 

The organization of this paper is as follows; In Section JIJ we define the model used in circular 
tomography and introduce the tools used for calibration in such a setup. In Section JIIJ we present 
the mathematical basis for the problem. In Sections JV] and |V] an overview of matrix completion and 
multidimensional scaling methods is provided. Then in Section [VT] our main results for calibration are 
presented. Section IVIII contains the proofs for the main results and finally Section I VIII I is devoted to the 
simulation results. 

II. Circular Time of Flight Tomography 

The focus of this research is ultrasound tomography with circular apertures. In this setup, n ultrasound 
transmitters and receivers are installed on the interior edge of a circular ring and an object with unknown 
acoustic characteristics is placed inside the ring. At each time instance a transmitter is fired, sending 
ultrasound signals with frequencies ranging from hundreds to thousands of kHz, while the rest of the 
sensors record the received signals. The same process is repeated for all the transmitters. Each one of n 
sensors on the ring is capable of transmitting and receiving ultrasound signals. The aim of tomography 
in general is to use the recorded signals in order to reconstruct the characteristics of the enclosed object 
(e.g. sound speed, sound attenuation, etc.). The general configuration for such a tomography device is 
depicted in Fig. [3] Employing these measurements, an inverse problem is constructed, whose solution 
provides the acoustic characteristics of the enclosed object. 

There are two common methods for solving the inverse problem. The solutions are either based on 

I 22L Both techniques consist of forward modeling the 



the wave equation 112 1H or the bent-ray theory 



problem and comparing the simulation results with the measured data. For the details see 112 ill and 12211 . 



Nevertheless, in both cases, in order to simulate the forward model and rely on the recorded data, a very 
precise estimate of the sensor positions is needed. In most applications (e.g., 1 23h 25 it is assumed that 
the sensors are positioned equidistant apart on a circle and no later calibration is performed to find the 
exact sensor positions. The main objective of this paper is to estimate the precise positions of the sensors. 

A. Homogeneous Medium and Dimensionality Reduction 

In order to estimate sensor positions, we utilize the ToF measurements for a homogeneous medium 
(e.g. water in the context of breast cancer detection). Let's assume that the mutual ToFs are stored in a 
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Fig. 3. Circular setup for ultrasound tomography considered in this work. Ultrasound transducers are distributed on the edge of 
a circular ring and the object with unknown characteristics is put inside. Transmitters and receivers are collocated. Transducers 
are fired each in turn while the rest of sensors recording the ultrasound signals reaching them. In practice, the positions deviate 
from an ideal circle. 



matrix T. In a homogeneous medium, entries of T represent the time travelled by sound in a straight 
line between each pair of a transmitter and receiver. 

Knowing the temperature and the characteristics of the medium inside the ring, one can accurately 
estimate the constant sound speed cq. Thus, it is reasonable to assume that cq is fixed and known. Having 
the ToFs for a homogeneous medium where no object is placed inside the ring, we can construct a 
distance matrix D consisting of the mutual distances between the sensors as 

D = [d i>j ] = c Q T, T=[t itj ], i,j £ {1, • • • , re} (1) 

where is the ToF between sensors i and j and n is the total number of sensors around the circular 
ring. Notice that the only difference between the ToF matrix T, and distance matrix D, is the constant 
Co- This is why in the sequel our focus will mainly be on the distance matrix rather than the actual 
measured matrix T. 

Since the enclosed medium is homogeneous, the matrix T is a symmetric matrix with zeros on the 
diagonal and so is the matrix D. Even though, the distance matrix D is full rank in general, a simple 
point-wise transform of its entries will lead to a low rank matrix. More precisely, we can prove (see 
Appendix lAl) the following lemma: 

Lemma 1. If one constructs the squared distance matrix D as 

D = [dU , 

then the matrix D has rank at most 4 J4J] and if the sensors are placed on a circle, the rank is exactly 3. 
In practice, as we will explain in the next section, many of the the entries of the ToF matrix (or 
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equivalently the distance matrix) are missing and there is an unknown time delay added to all the 
measurements. 

B. Time of Flight Estimation 

,1 



Several methods for ToF estimation have been proposed in the signal 



processing community 1122 , 



These methods are also known as time-delay estimation in acoustics [27]. In these methods, the received 
signal is compared to a reference signal (ideally the sent signal), and the relative delay between the two 
signals is estimated. Since the sent signal is not available in most cases, the received signal through the 
object is compared to the received signal when the underlying medium is homogeneous. However, this 
assumption is not true in our case. In the calibration phase, we have only signals passed though the 
homogeneous medium. Thus, there is not any reference signal to find the relative time-of-flights. 

Because of the above limitations, we are forced to estimate the absolute ToFs. For this purpose, we 
use the first arrival method. This method probes the received signal and defines the time-of-fiight as the 
time instant at which the received signal power exceeds a predefined threshold. 

In practical screening systems, to record measurements for one fired transmitter, all the sensors are 
turned on simultaneously and after some unknown transition time (which is caused by the system structure, 
different sensor responses, etc.), the transmitter is fed with the electrical signal and the receivers start 
recording the signal. This unknown time may change for each pair of transmitters and receivers. We will 
see that this unknown time delay plays an important role in sensor position estimation. 

The beam width of the transducers and the transition behaviour of the ultrasonic sensors prevent the 
sensors to have a reliable ToF measurement for close-by neighbours. This results in incorrect ToF values 
for the sensors positioned close to each other. Therefore, numbering the sensors on the ring from 1 to n, 
in the ToF matrix T, we will not have measurements on a certain band around the main diagonal and on 
the lower left and upper right parts as well. We call these missing entries as structured missing entries. 
This is illustrated in Fig. 01 The links shown by dashed lines do not contribute in the ToF measurements, 
because the beam for the transmitter does not cover the gray part. 

During the measurement procedure, it may also happen that some sensors do not act properly and 
give outliers. Thus, one can perform a post processing on the measurements, in which a smoothness 
criterion is defined and the measurements not satisfying this criterion are removed from the ToF matrix. 
We address these entries as random missing entries. An instance of the ToF matrix with the structured 
and random effects is shown in Fig. [5J where T nc denotes the incomplete ToF matrix and the gray entries 
correspond to the missing entries. Furthermore, in practice, the measurements are corrupted by noise. 

The above mentioned problems result in an incomplete and noisy matrix T, which cannot be used for 
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Fig. 4. The beam width of the transmitter causes the neighbouring sensors not to have reliable ToF measurements. This is 
shown by dashed lines in the figure. The area shown in gray corresponds to the part which is not covered by the transmitter's 
wave beam. This results in the structured missing entries. 



T = 

mc 



Fig. 5. A sample incomplete ToF matrix with structured and random missing entries. 

position reconstruction, unless the time delay effect is removed, the unknown entries are estimated, and 
the noise is smoothed. 

III. Problem Setting 

We observed that the distance matrix, when the aperture is in homogeneous medium, is calculated as 
in (0Q). We also saw in the previous section that the measurements for the ToF matrix T have three major 
problems : they are noisy, some of them are missing, and the measurements contain some unknown time 
delay. For simplicity, we will assume that this time delay is constant for all the transmitters, namely all 
the transmitters send the electrical signal after some fixed but unknown delay to- Hence, we can rewrite 
the ToF matrix as follows 

f = T + t A + Z , 
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Fig. 6. Sensors are distributed around a circle of radius r with small deviations from the circumference, 
where T consists of ideal measurements for ToF, Zq is the noise matrix and A is defined as 

I otherwise . 

With the above considerations, the distance matrix can also be written as 

D = D + d A + Z, (2) 

where D = cqT, do = coto, and Z = cqZq. 

In our model we do not assume that the sensors are placed exactly on the ring. What happens in 
practice is that the sensor positions deviate from the circumference and our ultimate goal is to estimate 
these deviations or equivalently the correct positions (see Fig. [6]). The general positions taken by sensors 
are denoted by the set of vectors {xi, . . . ,x n }. 

As described earlier, there are two contributions to missing entries. One is the missing measurements 
of close-by sensors, which we call structured missing entries. The other is the missing measurements 
due to random malfunction of sensors, which we call random missing entries. First, to incorporate the 
structured missing entries, we assume that any measurements between sensors of distance less than 5 n are 
missing (see Figure [6]). Hence, the number of structured missing entries depends on ti%. We are interested 
in the regime where we have a small number of structured missing entries per row in the large systems 
limit. Accordingly, typical range of 5 n of interest is S n = 6( ry/\og n/n). A random set of structured 
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missing indices S C [n] x [n] is denned from {xi} and r5 n , by 

S = : dij < S n and i ^ j} , 

where djj = ||iCj — Then, the structured missing entries are denoted by a matrix 

D id if(i,j)e5, 

otherwise . 

Note that the matrix D s = D — D s captures the noiseless distance measurements that is not effected 
by structured missing entries. This way, we can interpret the matrix D s as additive noise in our model. 
Likewise, for the constant additive time delay we can define 

I otherwise , 

where S 1 - denotes the complementary set of 5. Next, to model the noise we add a random noise matrix 



Zij if (i,j)eS ± , 
otherwise . 

We do not assume a prior distribution on Z, and the main theorem is stated for any general noise matrix 
Z, deterministic or random. One practical example of Z is an i.i.d. Gaussian model. 

Finally, to model the random missing entries, we assume that each entry of D s + toCoA s + Z s is 
sampled with probability p n . In the calibration data, we typically see a small number of random missing 
entries. Hence, in order to model it we assume that p n = 0(1). Let E C [n] x [n] denote the subset 
of indices which are not erased by random missing entries. Then a projection Ve '■ M nxn — > M. nxn is 
defined as 

otherwise . 
We denote the observed measurement matrix by 

N E = V E (D s + d A E + Z s ), (3) 

where do = toco is a constant. Notice that the matrix N E has the same shape as T| nc shown already 
schematically in Fig. [5] Now we can state the goal of our calibration problem: 

Given the observed matrix N E and the missing indices 5 U E^, we want to estimate a matrix D 
which is close to the correct distance matrix D. Then by using D we would like to estimate the sensor 
positions. 



October 13, 2011 



DRAFT 



12 



In order to achieve this goal, there are two obstacles we need to overcome. First, we need to estimate 
the missing entries of N E and second, we want to find the sensor positions given approximate pairwise 
distances. The former is done by employing a matrix completion algorithm and the latter by using the 
multidimensional scaling method. 

IV. Matrix Completion 



OptSpace, introduced in 11711 . is an algorithm for recovering a low -rank matrix from noisy data with 
missing entries. The steps are shown in Algorithm Q] Let M be a rank-g matrix of dimensions n x n, 
Z the measurement noise, and E the set of indices of the measured entries. Then, the measured noisy 
and incomplete matrix is M E = Ve(M + Z). 



Algorithm 1 OptSpace 111711 



Input: Observed matrix M E = V E {M + Z). 
Output: Estimate M. 
1: Trimming: remove over-represented columns/rows; 

Rank-g projection on the space of rank-g matrices according to ©; 



Gradient descent: Minimize a cost function F(-) defined in 11711 : 



In the trimming step, a row or a column is over-represented if it contains more samples than twice 
the average number of samples per row or column. These rows or columns can dominate the spectral 
characteristics of the observed matrix M E . Thus, some of their entries are removed uniformly at random 
from the observed matrix. Let M E be the resulting matrix of this trimming step. This trimming step is 
presented here for completeness, but in the case when p n is larger than some fixed constant (like in our 
case where p n = Q(p)), M E -M E with high probability and the trimming step can be omitted. 

In the second step, we first compute the singular value decomposition (SVD) of M E . 

n 

M E = Y J <M E )u l vJ , 
i=i 

where crj(-) denotes the i-th singular value of a matrix. Then, the rank-q projection returns the matrix 

q 

V q (M E ) = (1/ Pn ) £ ai(M E ) Ui vf, (4) 

i=l 

obtained by setting to all but the q largest singular values. 

Starting from the initial guess provided by the rank-q projection V q (M E ), the final step solves a 



minimization problem stated as the following 11711 : 



Given X £ R nx i, Y G M nx « with X T X = 1 and Y T Y = 1, define 

F(X,Y) = min T(X,Y,S), 



JF(X, Y,S) = l ( M m " (XSY T ) hj f . 



2 
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Values for X and Y are computed by minimizing F(X,Y). This consists of writing V q (M E ) = 
XqSqY^ and minimizing F(X, Y) locally with initial condition X = Xq and Y = Yq. This last step 
tries to get us as close as possible to the correct low rank matrix M. 

V. Position Reconstruction 

Even if we had a good estimate of D, how we would position the sensors is not a trivial question. 
Multidimensional scaling (MDS) is a technique used in finding the configuration of objects in a low 
dimensional space such that the measured pairwise distances are preserved. If all the pairwise distances 
are measured without error, then a naive application of MDS exactly recovers the configuration of sensors 



28]. 



Algorithm 2 Classical Metric MDS I12I1 . 



Input: Dimension 77, estimated squared distance matrix D 
Output: Estimated positions MDS^-D) 

1: Compute (-1/2)L£>L, where L = I„ - (l/n)l n l£; 

2: Compute the best rank-d approximation U^^Ul^ of (— 1/2)L.DL; 

3: Return MDS^-D) = U n Y, l r / 2 . 



There are various types of MDS techniques, but, throughout this paper, by MDS we refer to the 
classical metric MDS, which is defined as follows. Let L be an n x n symmetric matrix such that 

L = In - {l/n)\ n \l, (5) 

where l n £ W 1 is the all ones vector and I n is the nxn identity matrix. Let MDS^-D) denote the n x rj 
matrix returned by MDS when applied to the squared distance matrix D. The task is to embed n objects 
in a 7] dimensional space MP. In our case for instance, where we want to find the position of sensors on 
a two dimensional space, we have rj = 2. Then, in the equation, given the singular value decomposition 
(SVD) of a symmetric and positive semidefmite matrix (— 1/2)L.DL as (— 1/2)LZ)L = U"EU T , 

MDS„(£) = Ur,^ 2 , 

where U v denotes the n x r] left singular matrix corresponding to the r] largest singular values and S r/ 
denotes the r) x rj diagonal matrix with 77 largest singular values in the diagonal. This is also known 
as the MDSLOCALIZE algorithm in |4|]. Note that since the columns of U are orthogonal to l n by 
construction, it follow that 

L • MDS^-D) = MDS r? (-D). (6) 
It can be easily shown that when MDS is applied to the correct and complete squared distance matrix 



October 13, 2011 



DRAFT 



14 



without noise, the configuration of sensors are exactly recovered |4|l. This follows from 

1 - T 
- 2 hDh = tXX T L , (7) 

where X denotes the n x rj position matrix in which the i-th row corresponds to Xi, the i] dimensional 
position vector of sensor i. Note that we only get the configuration and not the absolute positions, in the 
sense that MDS r) (-D) is one version of infinitely many solutions that matches the distance measurements 
D. Intuitively, it is clear that the pairwise distances are invariant to a rigid transformation (a combination 
of rotation, reflection and translation) of the positions X, and therefore there are multiple instances of 
X that result in the same D. For future use, we introduce a formal definition of rigid transformation 
and related terms. 

Denote by 0(?7) the group of orthogonal i] x r] matrices. A set of sensor positions Y G M nx,? is a 
rigid transform of X, if there exists a 77-dimensional shift vector s and an orthogonal matrix Q G 0(r/) 
such that 

Y = XQ + t n s T . 

Y should be interpreted as a result of first rotating (and/or reflecting) sensors in position X by Q and 
then adding a shift by s. Similarly, when we say two position matrices X and Y are equal up to a rigid 
transformation, we mean that there exists a rotation Q and a shift s such that Y = XQ + l n s T . Also, we 
say a function f(X) is invariant under rigid transformation if and only if for all X and Y that are equal 
up to a rigid transformation, we have f(X) = f(Y). Under these definitions, it is clear that D is invariant 
under rigid transformation, since for all (i, j), Dij = ||ccj — xj\\ = \\(xiQ + s T ) — (xjQ + , for 
any Q G 0(77) and s G R r '. 

Let X denote annxij estimation for X with estimated position for sensor i in the i-th row. Then, 
we need to define a metric for the distance between the original position matrix X and the estimation 
X which is invariant under rigid transformation of X or X. 

The matrix L defined in © is a symmetric matrix with rank n — 1 which eliminates the contributions 
of the translation. More precisely, 

LX =L(X + ts T ), 
for all s G R v . We can show that L has the following properties. 
Lemma 2. y, 



1311 Let the matrix L be defined as in ©. Moreover, let X and X be two position 
matrices with dimension n x r\. Then, we can show that 
• hXX T h is invariant under rigid transformation. 



• hXX T h = hXX 1 L implies that X and X are equal up to a rigid transformation. 
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TABLE I 








Summary of Notation. 






Symbol 


Meaning 


Symbol 


Meaning 


n 


number of sensors 


D 


complete noiseless distance matrix 


ro 


radius of the circle from which the sensors deviate 


D 


squared distance matrix 


a/2 


maximum radial deviation from the circle 


D 


noisy distance matrix 


IE 

to 


piojeciion lino mauices wiui eiiLiies on uiuex sei Hi 
unknown time delay added to the ToF measurements 


fj 
Z 


esmiidieu squdieu instance mauix 
noise matrix 


do 

Pn 


distance mismatch caused by the unknown time delay 
probability of having random missing entries 


N E 
X 


observed matrix 
positions matrix 


D s 


radius of the circle defining structured missing entries 
distance matrix with observed entries on index set S 


X 


estimated positions matrix 



This naturally defines the following distance between X and X. 



LXX T L - LXX^L 



(8) 



d(X,X) = - 

n 

where \\-\\ F denotes the Frobenius norm. 

According to Lemma |2j this distance is invariant to rigid transformation of X and X. Furthermore, 
d(X, X) = implies that X and X are equal up to a rigid transformation. We later state our theoretical 
results in terms of the distance defined in ([8]). 

VI. Main results 

We saw that the OptSpace algorithm is not directly applicable to the squared distance matrix because 
of the unknown delay. Since A in Q is a full rank matrix, the matrix D © D = [df j] no longer has 
rank four. Moreover, since the measurements are noisy, one cannot hope for estimating the exact value 
for do- Therefore, in the following we will provide error bounds on the reconstruction of the positions 
assuming that the time delay (equivalently do) is known. Afterwards, a heuristic method is proposed to 
estimate the value of do. 

In Table U the set of important notations used in the sequel is summarized. 

Theorem 1. Assume n sensors are distributed independently and uniformly at random on a circular ring 
of width a with central radius ro as in Fig. [4] The resulting distance matrix D is corrupted by structured 
missing entries D s and measurement noise Z s . Further, the entries are missing randomly with probability 
p n . Let N E = Ve(D — D s + Z s ) denote the observed matrix. Define D as the squared distance matrix. 
Assume 5 n = 5ro \/log n/n and p n = p. Then, there exist constants C\ and C2, such that the output of 
OptSpace D achieves 

\We{Y 



n 



\D - D] F < d 



log n 



+ C 2 



pn 



(9) 



with probability larger than 1 — n 3 , provided that the right hand side is less than ai(D)/n. We have 



" h3 



2Z h D tr 
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Algorithm 3 Finding do- 



Input: Matrix N E ; 
Output: Estimate do; 
1: Construct the candidate set Q = {d^\ . . . , d M ^} containing discrete values for do. 



for k = 1 to M do 

Set N E = N E — d {k) A E ; 
Set Nf k) =Nf k) QN^; 

Apply OptSpace on iV£\ and call the output N^; 

Apply MDS and let X&i = MDS 2 (N {k) ); 
Find 

Jk) _ V- (Ak) , II ^ (fc) ^ (fc)|| 

end for 

Find do satisfying 



The above theorem, in great generality, holds for any noise matrix Z, deterministic or random. The 
above guarantees only hold 'up to numerical constants'. To see how good OptSpace is in practice, we 
need to run numerical experiments. For more results supporting the robustness of OptSpace, we refer 



to H18h. 



Corollary 1. Applying multidimensional scaling algorithm on D, the error on the resulting coordinates 
will be bounded as follows 

\\nl pn 
with probability larger than 1 — 1/n 3 . (The proof is given in Appendix IB]) 

In Algorithm [3 we propose a heuristic method for estimating the value of do along with completion 
of the squared distance matrix. 

In fact, this algorithm guarantees that after removing the effect of the time delay, we have found the best 
rank 4 approximation of the distance squared matrix. In other words, if we remove exactly the mismatch 
do, we will have an incomplete version of a rank 4 matrix and after reconstruction, the measured values 
will be close to the reconstructed ones. 

VII. Proof of Theorem [I] 

This section is dedicated to the proof of our main result. To do so we apply Theorem 1.2 of [^j] to the 
rank-4 matrix D and the observed matrix N E = Ve(D - D s + Z s ). 

First, we provide the definition of a crucial property of D which is called incoherence. Following 



the definition in 



, a rank-4 symmetric matrix D G W lXn is said to be ^-incoherent if the following 



conditions hold. Let UT>U T be the singular value decomposition of D. 
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AO. For all i & [n], we have Ylk=i ^ik — 4/V n - 

Al. For all i E [n], j G [re], we have \Dij/a\(D)\ < \/4/V n - 

The extra 1/re terms in the right hand side are due to the fact that, in this paper, we assume that the 
singular vectors are normalized to unit norm, whereas in [60 the singular vectors are normalized to have 
norm y/n. 

Theorem 1.2 of iQ] states that if a rank-4 matrix D is p-incoherent then the following is true with 
probability at least 1 — 1/n 3 . Let <Ji(D) be the ith singular value of D and k(D) = a\{D)/a 4 {D) 
be the condition number of D. Also, let D denote the estimation returned by OptSpace with input 
N E = Ve{D — D s + Y s ). Then, there exists numerical constants C\ and C 2 such that 

l M n ^ 11^(^)112 + 11^(^)112 nn 
— \\D - L) \\ F < Gi , (11) 

n pn 



provided that 



and 



np > C 2 u 2 K{Df\ogn , (12) 



Ci \\Ve{D 8 )\\2 + \\Ve{Y s )\\ 2 < 04(g) (13) 
pn ~ n 

First, using Lemma [3l we show that the bound in (TTTb gives the desired bound in the theorem. Then, it 
is enough to show that there exists a numerical constant N such that the conditions in (fT2l and (fT3l) are 
satisfied with high probability for n > N. 

Lemma 3. In the model defined in the previous section, n sensors are distributed independently and 
uniformly at random on a circular ring of width a with central radius r . Then, with probability larger 
than 1 — ra~ 3 , there exists a constant c such that 



\\Ve(D s )\\2 < c5 3 (r + a) 2 ( J^Ypn . (14) 



1? 



where Ve{') an d D s are defined as in ([3]). The proof of this lemma can be found in Appendix ICl 

Now, to show that (fT2l holds with high probability for n > Clogn/p for some constant C, we show 
that k < f K (ro, a) and u < / M (ro, a) with high probability, where f K and / M are independent of n. Recall 
that k(D) = a 1 {D)/a 4 {D). We have 

D<i j — j j tZ,-- j j j I j j tXj j j j Stic^ j 

= (r + pi) 2 + (r + pj) 2 - 2xfxj 

= 2r\ + (2r oPl + p 2 ) + (2r oPj + p 2 ) - 2xJ Xj , 
where p,i is distributed in such a way that we have uniform distribution over the circular band. Thus, one 
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can show that 



D = ASA T 



where 

£1,1 £1,2 2r pi + p\ 

Jo x n ,2 £n,2 2r p„ + p 2 

One can write S as 

S = UAU~ 1 , A = diag ( -2, -2, 



2 








J_ 




r 





-2 














-2 





j_ 











.''0 



ro 



It follows that cri(-D) < 



0i(AA J ) and 04(D) > min 2, 



ro 

Vl+rg-rp 



4 (AA J ). We can 



compute the expectation of this matrix over the distribution of node positions. Having uniform distribution 



of the sensors over the circular ring, we have for the probability distribution of p: 

P P (p) 



r + p a a 
for < p < — . 



r a 

Thus, the expectation of the matrix A T A is easily computed as 



E[A T A] 



nr. 














a 2 



nr ^ 





»(S+3r: 



f(r 2 + ^) 



Let the largest and smallest singular values of E[A T A] to be na max (ro,a) and na mm (ro,a). Using the 
fact that 0j(-) is a Lipschitz continuous function of its arguments, together with the Chernoff bound for 
large deviation of sums of i.i.d. random variables, we get 

P(0!(AA r ) > 2n0 max (r o ,a)) < e~ Cn , 

P(0!(AA T ) < (l/2)n0 max (r o ,a)) < e~ Cn ., 

P(0 4 (AA T ) < (l/2)na min (ro,a)) < e~ Cn 

for some constant C. Hence, with high probability, k(D) < ^"^(ro'a) = /«( r 0i a )- 

Now to bound p, note that with probability 1 the columns of A are linearly independent. Therefore, 
there exists a matrix B £ W xr such that A = VB T with V T V = I. The SVD of D then reads 
D = UY,U T with E = Q T B T SBQ and U = VQ for some orthogonal matrix Q. To show incoherence 



(15) 
(16) 
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property AO, we need to show that, for all i £ [n], 

12 / ^ 



Vi r < . 

n 



Since Vi = B 1 A { , we have ||Vi|| 2 < a 4 (B)~ 2 \\Ai\\ 2 < cr 4 (A)~ 2 || Aj|| 2 . Combined with ||Ai|| 2 = 
*o + (ro + Pi) 2 + (2r Pi + p 2 ) 2 < r 2 , + (r + a) 2 + (2r o + a 2 ) 2 and CCD, we have 

m 2<UM , (17) 
n 

with high probability, where /^(ro, o) = 2(rg + (r + a) 2 + (2r a + a 2 ) 2 ). 



To show incoherence property Al, we use \ Dij\ < (2ro+a) 2 and cri(D) > \n a m i n (ro, a) min ( 2 
from <[T5]>- Then, 

I Ail ^ g(r ,a 



< , (18) 



<Ji{D) ~ n 

with high probability, where giro, a) = m ax I 2, , 4r ° — I (2ro + a) 2 /<T m i n (ro,a). Combining (fT71 ) and 

\ v 1+r o-n) y 



181) . we see that the incoherence property is satisfied, with high probability. 



Further, (fT3l) holds, with high probability, if the right-hand side of (O is less than C3^^^^-cr max (ro, a 



since ^(Z?) < ^ra " v ^ 1 ^° -^-o'max(^0i Q )- This finishes the proof of Theorem [TJ 



VIII. Simulation Results 

In order to evaluate the performance of the calibration method, three sets of experiments are done. First, 
the distance matrix is assumed noiseless and the value of the do is set to zero. The position estimation 
error is derived for different values of n and the ring width a. The value of r is set to 10 cm, on average 
5 percent of entries are missing randomly, and S in Theorem [TJ is assumed to be 1 . For each value of a 
and n, the experiment is repeated 10 times, and the average is taken. The results are reported in Fig. [7] 
As expected from Corollary [TJ the greneral trend in all curves is that the error decreases as n grows. 
Moreover, the larger a is, the bigger is the reconstruction error, which is also coherent with the results 
of Corollary [TJ 

To examine the stability of the estimation algorithm under noise, we set the values of a to 1 cm, 5 to 
1, ro to 10 cm, to to zero, and the percentage of random missing entries to 5. We added to each entry of 
the distance matrix D a centred white Gaussian noise of different standard deviations. For each n and 
standard deviation of noise, the experiments are repeated 10 times and the average is taken. The results 
are depicted in Fig. [8]Q . As the variance of the noise increases, the position estimation error grows, but 
in general the error decreases for larger n. 

'There has been a slight mislabeling in the earlier version of this paper in 1 1 1 which is corrected in this paper. 
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Fig. 7. Error in position estimation in noiseless case for different values of a. As n increases, the reconstruction error tends 
to zero. The estimation error increases for larger values of a, which confirms the results of Lemma [5] 
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Fig. 8. Error in position estimation for the case with centered white Gaussian noise of different standard deviations, a. 



As we discussed in Section II-AI one might treat the calibration problem as a special case of the sensor 
localization problem. However, there is a duality between the calibration problem and the traditional 
sensor localization problems. We showed in Section JI] that in the calibration problem the local dis- 
tance/connectivity information is not available whereas most of the state-of-the-art algorithms for sensor 
localization are based on the local information. In order to compare the performance of these methods 
with the proposed methods, a set of simu 



ations are performed. We compared the localization results 
of our method to the ones of Mds-Map [12], SDP-based |14] and also SVD-RECONSTRUCT [4]. The 



position reconstruction error (defined in ([8])) versus the number of sensors, n for the methods is reported 
in Fig. [9] in a log-log scale. 

For the simulations, we set the values of a to 1 cm, 5 to 1, ro to 10 cm, to to zero, and the percentage 
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Fig. 9. Error in position estimation versus the number of sensors for different methods. 



of the random missing entries to 5. The distance measurements were corrupted with a white Gaussian 
noise of standard deviation 0.6 mm. For each method and each n, the experiment is performed 10 times 
for different positions and different noises, and the average error is taken. For the SDP-based method, we 



have used the algorithm presented in 111 411 and the code published by the same authors. For Mds-Map, 
we have estimated the shortest paths using Johnson's algorithm [29]. Finally for SVD-RECONSTRUCT, 
we used the algorithm in |4J]. In order to adapt the measurements with the assumptions of the method, we 
assumed that pij = 1 — 0.05 = 0.95 for the measured points (note that 0.05 is on average the probability 
of having a random missing entry) and jij = 0. 

As the results in Fig. [9] suggest, Mds-Map and SVD-RECONSTRUCT methods perform very bad 
compared to the other two methods. The poor performance of Mds-Map is for the fact that it highly 
relies on the presence of local distance information, whereas in our case, these measurements are in 
fact missing. This method is based on estimating the missing distance measurements with the shortest 
path between the two sensors. However, one can easily see that given faraway sensor distances, the 
shortest path is a very coarse approximation of the distance between close-by sensors, also note that as 
the simulation results show, there is no guarantee that the estimation error will decrease as n grows. 

For SVD-RECONSTRUCT, the unrealistic assumption that all the sensors have a non-zero probability 
of being connected causes the bad results of the method. In our case, the probability that the close-by 
sensors are connected is zero because of the structured missing entries. In fact, since py is high, one 
could see this method as simply applying the classical MDS on the incomplete distance matrix. The 
surprising observation about the performance of this method is that the estimation error does not change 
much with n. 
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(a) Incomplete distance squared matrix (b) Completed distance squared matrix 



Fig. 10. Input and output of OptSpace algorithm. |(a)| The incomplete distance squared matrix D, with 5 percent of entries 
randomly missing, to = and 5 n = 3cm. |(b)| The completed matrix with estimated to = 10/is. The modified OptSpace 

algorithm in this case can find the time mismatch correctly. 

In contrast to the two aforementioned algorithms, the SDP-based method performs very well for 
estimating the sensor positions and the reconstruction error is very close to the one of the proposed 
method. The reason is the fact that this method does not directly rely on the local distance information. 
In fact, the distance measurements are fed to the algorithm as the constraints of a convex optimization 
problem. However, as the number of sensors goes large, the number of measurements also grows and so 
does the number of constraints for the semi-definite program. This causes the method not to work for n 
larger than 150 in our case. The same limitation is also reported by the authors of the method. 

In summary, taking the computational cost and reconstruction accuracy of the algorithms into account, 
the proposed method performs significantly better. 

Moreover, to show the importance of calibration in an ultrasound scanning device, a simple simulation 
is also performed. If the ToF measurements correspond to the exact positions of sensors without time 
delay to, reconstruction of water will lead to a homogeneous region with values equal to the water sound 
speed, whereas wrong assumption on the sensor positions and to causes the inverse method to give 
incorrect values as the sound speed to compensate the effect of position mismatch. 

In a simple experiment, we simulated the reconstruction of water sound speed (co = 1500) using the 
ToF measurements. In the simulation, 200 sensors are distributed around a circle with radius ro = 10 
cm, and they deviate at most 5 mm from the circumference and the ToF measurements are added by 
to = 10//S. The incomplete distance matrix is shown in Fig. |10(a)| 

In order to complete the distance matrix and find the time delay at the same time, we used Algorithm [3] 
We forced the rank of D to 4. The value for to is found as 4/is which is exactly as set in the simulation. 
The output of OptSpace algorithm is the completed D matrix which is shown in Fig. |10(b)| 

Using the completed distance matrix and the MDS method, the positions are reconstructed and fed to 
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an inverse tomography algorithm to reconstruct water sound speed. The results of the reconstruction are 
shown in Fig. [TT] In the figure, the results for four reconstructions are presented. In Fig. 11(a) the ToF 



matrix is not complete, it contains the time delay to, and the positions are not calibrated. The dark gray 
ring is caused by the non-zero time delay in the ToF measurements. In Fig. |1 l(b)[ the time mismatch is 
resolved using the proposed algorithm, but the sensor positions are not calibrated and the ToF matrix is 
still not complete. This figure shows clearly that finding the unknown time delay improves significantly the 
reconstruction image. Figure |1 l(c)[ shows the reconstructed medium when the ToF matrix is completed 
and time mismatch is removed, but the sensor positions are not yet calibrated. From this figure, it is 
confirmed that accurate time-of-flights are necessary but not sufficient to have a good reconstruction of 
the inclosed object. Finally, Fig. |1 l(d)| shows the reconstruction when the positions are also calibrated. 
Notice the change in the dynamic range for the last case. 

IX. Conclusion and Future Work 

In this work we introduced a theoretical framework for calibration in circular ultrasound tomography 
devices. We proposed a novel calibration algorithm for which we provided theoretical bounds on the 
performance. We also tested our method through exhaustive simulations to demonstrate its functionality 
in practice. We compared the algorithm with some state-of-the-art centralized sensor localization methods 
and showed that our method outperforms those in estimating the correct sensor positions. 

Even though we introduced a recursive algorithm for finding the time-delay, we were not able to provide 
theoretical guarantees on its convergence. We mainly observed its convergence through simulations. This 
is still an interesting theoretical challenge and requires further work. We also believe that our approach 
can potentially be deployed beyond circular to other popular topologies with simple geometry. 
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Appendix 

A. Proof of Lemma [7J 

The proof for the general case where the sensors are not on a circle is provided in |4j]. In the circular 
case however, we have Dij = \\xi\\ 2 + ||a;j|| 2 — 2xfxj = 2r 2 — 2xfxj, where r is the circle radius. 
Thus, the squared distance matrix is decomposable to 



where 





D = 


VEV T , 










r xi,i £i >2 




2 








V = 




, s = 





-2 







T 3?n,l 3?n,2 










-2 



This finishes the proof. ■ 
B. Proof of Corollary \J} 

Note that in general (LXX T L — LXX T L) has rank at most 2d where d is the dimension of the space 
in which sensors are placed (in our case d = 2). Therefore, 



LXX T L - LXX^L 



< V2d 



LXX T L - LXX^L 



where we used the fact that for any matrix A of rank r we have \\A\\ F < y/r\\A\\ 2 . Furthermore, the 
spectral norm can be bounded in terms of D and D as follows. 



LXX T L - LXX 1 L 



(a) 

< 



LXX T L 



1, 



-LDL 



+ 



(b) 1 
< - 

~ 2 



UD - D)h 



2 
1 

2 + 2 



1 



-LDL — XX 1 



L(-D + D)L 



(19) 
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where in (a), we used the triangle inequality and ([6]), namely, hX = X. In (6), we used (0 and the 



fact that for any matrix A of rank d, 



LDL - XX 



< 



±LDL - A 



In particular, by setting 



A = |LZ)L the second term in ( fl9l) follows. Since L is a projection matrix we have ||L|| 2 = 1. Hence, 
from ( fl9l ) we can conclude that 



LXX T L - LXX 1 L 

This immediately leads to Corollary Q] ■ 
C. Proof of Lemma\3\ 

Note that by the definition of D s , we have \VE(D s )ij\ < b\ for all i and j. Define A as 

i if (i,j)e Ens, 

otherwise . 

We start from a simple realtionship between an elementwise bounded matrix and its operator norm. 

\\V E (D S )\\ 2 < Si max V \ Xi \ \ Vj \ A itj = S 2 JA\\ 2 . 
M\=\\y\\=^ 

The inequlity in (l20l ) follows from the fact that Ve{D s ) is elementwise bounded by 6 n . We can further 
bound the operator norm ||A||2, by applying the celebrated Gershgorin circle theorem to a symmetrized 
version of A. Define a symmetric matix A as 

1 if e EnS or e Ens, 

otherwise . 

Since < Ajj < Aij for all i and j, we have ||A||2 < ||A||2. Applying the Gershgorin circle theorem 
we get 

||A||2 < max |Ajj-| . 



< 



D-D 



«e \n 



Define random variables {Yi, . . . , Y n }, where Yi is the number of non-zero entries in the ith row of A. 
Then, 

||A||2 < m&xYi . 
ie[n] 

We need to show that Yi concentrates around its mean. Since Y^'s are binomial random variables, we 
can apply the Chernoff bound. Recall that G S if \\xi — Xj\\ < 5 n . By the definition of E, each 
sample is sampled with probability p. Then the probability that either or (j, i) is in E is 2p — p 2 . 

Each entry in the ith row of A is an independent Bernoulli random variable with probability of being 
one equal to q(2p — p 2 ), where q is the probability that a pair is in S. Thus, we have K[Yj] = q{2p—p 2 )n. 
In order to find the bounds on E[Yj], we need to bound q. Figure [12] shows the process for obtaining the 
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Fig. 12. The process for bounding the probability of a pair of sensors to fall in S. r 1 = t-q — a/2 and ri = t-q + a/2. 




bounds on q. 

f r2 2irr 

q = F{\x,i - Xj\ < 5 n ] = I — -s nrP2(r)dr , 

J ri TT(rt-r() 

where p 2 (r) = ^gjy. 
Upper Bound on A(r): 

Obviously the area A(r) can be bounded by what is shown in Fig. [13] Thus, we will have 

a 5 n 
sm — = — . 

2 r 
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Note that for < a < n, a/ir < sin a/2 < a/2. Hence, o/tt < 5 n /r < a/2. So, 

Mr)<^(rl-rl)< 6 £(rl-rl). 

Thus 

P2{r) ~ vr(rl-rf) = 2r ' 

2itr ^H-dr ^ n ^ n 



7r(r2 — rf) 2r r 2 + r\ 2tq 
Lower Bound on A(r): 

In order to find the lower bound, we consider the following two different situations: 1) S n < a and 2) 
5 n > a. 
Case 1 (5 n < a): 

In this case the minimum area of the intersection is achieved when the center of the circle is on the 



exterior boundary of the region as shown in Fig. 14(a) In this case, one can show that, 

7T<5 2 

A(r) > -f . (20) 

Case 2 (5 n > a): 

In this case, wherever the center of the circle is, it will have intersection with both bounding circles. 
Thus, the minimum area is achieved when the center of the circle is on the exterior boundary as in 
Fig. |14(b)[ where 



Thus, we will have 



Ayr) > (x 2 - xi) 



- « 2 )(4r 2 ~ gj + x/ggg ~ gj r\-r\ 
2r 2 2r 2 



yggrg - g) r 2 -r 2 _ /" l^ rf - r 2 

2^ 27T" n V (r2_ r J ^| _ 



If we assume that r 2 > -js5 n , which is a reasonable assumption according to the problem statement, we 



V2 

will have 



2r 2 ~ 2(r + a) 
Combining (1201) and (f2TT) . we can find the lower bound for A(r) as 

A(r) > min(j, ^ 2 )<5 2 = a7 "° 2 <5 2 . 

4 2(ro + a)^ 2(r + a) 2 
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(a) Lower bound in case 1. 



(b) Lower bound in case 2. 



Fig. 14. Evaluation of lower bound for A(r). In |(a)| we assume that S n < a whereas in |(b)| we take S n > a. In both cases the 
minimum intersection is achieved when the center of S n circle is on the exterior boundary of the region. 



Thus, 



2irr 



-P2(r)dr 



2irr 



A(r) 



-dr > 



n A r 2 ~ r l) Jn ^(rl - rf) vr(r| - rj) ~ 4vr(r + a) 2 ' 

From the above calculations, we have that ^^ +a y Pn n < JE[Y] < ^S n p n n. Applying the Chernoff 
bound to Yi, we have 

V< > (1 + a)E\Yi]) < 2 -( 1+a ) E ^. 



In other words 



(Yi > (l + a)-5 nPn n) < 2 
V r J 



TP™ n 



Applying the union bound, we get 



1 



max Y, > (1 + a)-S nPn n) < n 2^ l+a) ^^^ n < 
ie[n] r J 



By the assumption that 5 n p n = O(ro^/log 2 n/n), there exists constants c and N, such that b\p n > 
cr 2 log 2 n/n, for n > N. Define a positive parameter f3 such that 1 + 

4vr(l + 0) 



c(l+a)r„ -j^gjj W£ w jii [^yg 

47r(r +a) 2 



'( ma? 



maxK- > 



-(r + a) 2 5 n p n nj < n 13 . 



Finally with probability 1 — n 13 , 



ll^(i^| 2 <^^W> gn 



n 



pn = C(r + a) 2 5 3 1 



n 



pn . 



This finishes the proof of Lemma [3] 



October 13, 2011 



DRAFT 



